Q-Value Based Particle Swarm Optimization for Reinforcement Neuro- Fuzzy System Design

نویسندگان

  • Yi-Chang Cheng
  • Sheng-Fuu Lin
  • Chi-Yao Hsu
چکیده

This paper proposes a combination of particle swarm optimization (PSO) and Q-value based safe reinforcement learning scheme for neuro-fuzzy systems (NFS). The proposed Q-value based particle swarm optimization (QPSO) fulfills PSO-based NFS with reinforcement learning; that is, it provides PSO-based NFS an alternative to learn optimal control policies under environments where only weak reinforcement signals are available. The reinforcement learning scheme is designed by Lyapunov principles and enjoys a number of practical benefits, including the ability of maintaining a system's state in a desired operating range and efficient learning. In the QPSO, parameters on a NFS are encoded in a particle evaluated by Q-value. The Q-value cumulates the reward received during a learning trial and is used as the fitness function for PSO evolution. During the trail, one particle is selected from the swarm; meanwhile, a corresponding NFS is built and applied to the environment with an immediate feedback reward. The applicability of QPSO is shown through simulations in single-link and double-link inverted pendulum system. KeywordsNeuro-fuzzy system, particle swarm optimization, reinforcement learning, Q-learning.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimization and design of Adaptive Neuro-Fuzzy Inference System using Particle Swarm Optimization and Fuzzy C-Means Clustering to predict the scour after bucket spillway

Additionally, if the materials at downstream of bucket spillway are erodible, the ogee spillway is likely to overturn by the time. Therefore, the prediction of the scour after bucket spillway is pretty important. In this study, the scour depths at downstream of bucket spillway are modeled using a new meta-heuristic model. This model is developed by combination of the Adaptive Neuro-Fuzzy Infere...

متن کامل

Online Control of Nonlinear Systems using Neuro-Fuzzy Design tuned with Cooperative Particle Sub-Swarms Optimization

This paper proposes a TSK-type Neuro-Fuzzy system tuned with a novel learning algorithm. The proposed algorithm used an improved version of the standard Particle Swarm Optimization algorithm, it employs several sub-swarms to explore the search space more efficiently. Each particle in a sub-swarm correct her position based on the best other positions, and the useful information is exchanged amon...

متن کامل

ADAPTIVE NEURO-FUZZY INFERENCE SYSTEM OPTIMIZATION USING PSO FOR PREDICTING SEDIMENT TRANSPORT IN SEWERS

The flow in sewers is a complete three phase flow (air, water and sediment). The mechanism of sediment transport in sewers is very important. In other words, the passing flow must able to wash deposited sediments and the design should be done in an economic and optimized way. In this study, the sediment transport process in sewers is simulated using a hybrid model. In other words, using the Ada...

متن کامل

Adaptive Neuro-Fuzzy Control Approach Based on Particle Swarm Optimization

This paper proposes a modified particle swarm optimization algorithm (MPSO) to design adaptive neuro-fuzzy controller parameters for controlling the behavior of non-linear dynamical systems. The modification of the proposed algorithm includes adding adaptive weights to the swarm optimization algorithm, which introduces a new update. The proposed MPSO algorithm uses a minimum velocity threshold ...

متن کامل

Design of a New IPFC-Based Damping Neurocontrol for Enhancing Stability of a Power System Using Particle Swarm Optimization

The interline power flow controller (IPFC) is a concept of the FACTS controller for series compensation which can inject a voltage with controllable magnitude and phase angle among multi lines. This paper proposes a novel IPFC-Based Damping Neuro-control scheme using PSO for damping oscil‌la‌t‌i‌o‌ns in a power system to improve power system stability. The add‌i‌tion of a supplementary controll...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011